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Abstract 

We investigate the position of the Buchen-Kelly density [TU] in the family of entropy maximising 
densities from _21_ which all match European call option prices for a given maturity observed in 
the market. Using the Legendre transform which links the entropy function and the cumulant 
generating function, we show that it is both the unique continuous density in this family and the one 
with the greatest entropy. We present a fast root-finding algorithm that can be used to calculate the 
Buchen-Kelly density, and give upper boundaries for three different discrepancies that can be used as 
convergence criteria. Given the call prices, arbitrage-free digital prices at the same strikes can only 
move within upper and lower boundaries given by left and right call spreads. As the number of call 
prices increases, these bounds become tighter, and we give two examples where the densities converge 
to the Buchen-Kelly density in the sense of relative entropy when we use centered call spreads as 
proxies for digital prices. As pointed out by Breeden and Litzenberger in the limit a continuous 
set of call prices completely determines the density. 
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1 Introduction 

An important problem in derivatives valuation consists in finding suitable probability densities for the 
underlying asset such that observed market quotes are matched under risk-neutral pricing. In practice, 
many schemes rely on somehow choosing an interpolation method and hoping that the difference between 
one choice and another is not too big. However, since typically the first and second derivatives of the 
interpolating function have to be taken, these choices inevitably do end up having a big impact on 
derivative prices. 

The concept of entropy provides a clear criterion of how to deal with this problem in the most unbiased 
way possible. Making hard to justify ad-hoc assumptions becomes unnecessary, and it is therefore no 
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wonder that this approach is becoming more and more popular in the financial literature (see [3] , [3] , [7] , 

0, m, m, m, m, m)- 

In this paper, we continue our investigation on Maximum Entropy Densities (MEDs) begun in (21j . 
We showed that, under risk-neutral pricing and non arbitrage conditions, there is a unique density 
compatible with market prices of European call and digital options which maximises entropy. This 
density turns out to be the unique piece- wise exponential density which matches the given market prices. 

The first question we had in mind was how the Buchen-Kelly density introduced in ^lOj fit into 
this framework. The Buchen-Kelly density is obtained by imposing only the European call prices as 
constraints and finding the corresponding entropy maximiser. The result is a continuous piece-wise 
exponential density. 

Fixing the call prices and letting the digital prices vary (within certain bounds given by left and right 
call spreads) leads to a whole family Q of MEDs which all match the call prices. The probability mass 
assigned to each interval between two strikes can vary significantly across MEDs which all meet the same 
constraints imposed by the call prices. 

Our first result is to express the entropy H{g) for g (z G directly in terms of market data (Theorem 
14. ip given by call and digital prices. Since the call prices are fixed, we see if as a function of digital 
prices only. Using this result, we derive a simple formula that expresses the sensitivity of H with respect 
to changes in digital prices (Theorem 14. 2p . We also relate this sensitivity to the continuity of densities 
(Corollary 1331). 

Then, by explicitly calculating the Hessian matrix of H, we show that H has a unique critical point, 
and that the corresponding Buchen-Kelly density is the unique continuous density in Q (Corollarv l5.3p . 

Since we have shown that H is concave with respect to digital prices and calculated the gradient and 
Hessian of H explicitly, it is straightforward to implement a Newton-Raphson algorithm that finds the 
zero of the gradient. In particular, the Hessian matrix is tridiagonal and very easy to invert. By showing 
that H is indeed strongly concave (Proposition 16. 1) ) and explicitly calculating an upper boundary for the 
Hessian's biggest eigenvalue, we are able to give upper boundaries for the difference between the entropy 
of the Buchen-Kelly density and that of the density given by a set of digital prices, and for the Euclidean 
norm of the difference between the Buchen-Kelly digital prices and the given ones (Proposition 16.21) . 
Also, by using the Csiszar-Kullback inequality together with these results, we find an upper bound for 
the L^-distance of a given density in Q from the Buchen-Kelly density (Theorem 16. 3p . We conclude the 
section by illustrating the algorithm with two low-dimensional examples, and by pointing out some of 
its advantages over the original algorithm presented in |10| . 

Finally, we study how the family G becomes more constrained as the set of strikes increases. Given 
the call prices, arbitrage-free digital prices at the same strikes can only move within upper and lower 
boundaries given by left and right call spread prices. As the set of strikes increases, and call prices get 
closer to one another, these bounds become tighter, and we give two examples, one fictitious and one 
with call option data on the S&P, where the densities converge to the Buchen-Kelly density in the sense 
of relative entropy when we use centered call spreads as proxies for digital prices. In the limit, as pointed 
out by Breeden and Litzenberger [6], a continuous set of call prices completely determines the density. 
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2 The MED Obtained from Call Prices - The Buchen-Kelly 
Density 



For a fixed maturity T, assume prices C{Ki) of European call options on a particular asset are observed 
in the market for a set of n strikes < Ki < ... < Kn. Denote by Ci := C{Ki) / DF{T) the undiscounted 
prices of such calls, where DF{T) is the discount factor from today to maturity T. 

For notational convenience, we introduce the "strikes" Kq :— and Kn+i '■— oo. We set Cq to the 
T- forward price of the asset and Cn+i := 0. 

Let 5 be a strictly positive density over [0, oo[ for S{T) - the underlying asset price at time T - that 
under risk-neutral pricing matches the call prices, that is, 

/>oo 

[{S{T) ~ K,)+] ^ {x- K,)g{x)dx ^C,, \/i^O,...,n. (1) 

It is well known that call prices implied by g decrease with strikes. More precisely, [{S{T) — K)^] 
is strictly decreasing on K £ [0, oo[. Therefore, the following non-arbitrage condition holds: 

a >C',+i, Vi = 0,...,n. (2) 

Buchen and Kelly [TD] have shown that, under ([2]), there exists a unique density g on [0,cx)[ that 
maximises entropy and matches call prices ([T]). In the sequel we refer to this MED as the Buchen-Kelly 
density. 

Using a slightly different notation, Buchen and Kelly show that their MED is the unique density of 
the form g{x) — exp (X]r=o '^«(^ ~ -^i)^) which matches the call prices ([T|). They then proceed to 
find the parameters Aq, A„ (/i is a normalization constant which is easily expressed as a function of 
Aq, A„) through a well posed, but sometimes ill conditioned, multi-dimensional root-finding problem. 

We propose a different approach where rather than working on the space of parameters Aq, A„ we 
work on the space of digital prices implied by the densities. As we shall see, the root-finding problems 
we encounter are easier and more stable than the original one that Buchen and Kelly studied. 

Notice that the Buchen-Kelly density is continuous and piecewise-exponential. More precisely, on 
each interval [Ki,Ki+i[, g has the form g{x) = aie^^^ for some constants ai > and f5i G M. (See 
Section O) 



2.1 Implied Digital Prices 

Under risk neutral pricing, the undiscounted price of a digital option with strike K > Q implied by a 
strictly positive density g over [0,oo[ is given by 

/•oo 

[I{S(T)>K}] = / 9{x)dx. (3) 

J K 

It is also well known that [{S{T) — K)^] is strictly convex on K £ [0, oo[. Furthermore, from 
Lebesgue's dominated convergence, implied call and digital prices are related by 
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Therefore, provided that g matches market cah prices ([T]), the impUed digital prices are bounded by 
caU spread prices: 

(Here, the rightmost quantity for i = n must be read as zero.) 

In particular, the digital prices implied by the Buchen-Kelly density are also bounded by these call 
spread prices. Since the Buchen-Kelly density has piecewise-exponential form, the integral in ^ is 
straightforward to calculate analytically [21] . 



3 The MED Obtained from Call and Digital Prices 

If in addition to call prices, for the same maturity and set of strikes, the prices D{Ki) of European 
digitals are observed in the market (e.g. on the S&P 500 index from the CBOE, where they are called 
binary options [14 ), then we set Di :— D{Ki)/ DF{T), for i — 1, ...,n, Dq :— 1 and := 0. 

In this case, from we must have 

In |21| we have shown that under ([5]) there is a unique density g on [0,cxi[ which maximises entropy 
and matches both call prices H]) and digital prices: 

/>oo 

['^{S{T)>K,}] = g{x)dx = D,, Vi = 0, (6) 

"'A', 

In general, g is not continuous but, like the Buchen-Kelly density, it is still piecewise-exponential. 

Keeping the call prices fixed, we now let the digital prices Di, Z3„ vary inside the boundaries 
given in jll). We introduce the set C M" of all D = {Di, ...,i)n) € M" verifying ©. Note that fl is an 
open 71-dimensional rectangle. 

For each I) e we denote by gj~, the MED obtained in [21] . (See also Section 13.11 here for a review 
of some of the main results of [21] ■) Let Q = {g^j \ D E il} he the family of all MEDs obtained in this 
way. 

The Buchen-Kelly density has the greatest entropy among all densities over [0,oo[ matching call 
prices ([T]), and its implied digital prices verify (|3]). In other words, the Buchen-Kelly is the element of Q 
with the greatest entropy. 

In a typical market, where call prices are not available at a large number of strikes, the upper and 
lower boundaries for digital prices given in (H]) can be quite far apart, that is, both fl and G can be quite 
wide. 

However, quoting |10| . "as is often the case in practice, we may have many options trading close-to- 
the-money. Hence, there could be several closely spaced options, with strike differences small compared 
to the current asset price" . In this case, Buchen and Kelly claim that their multi-dimensional root-finding 
problem may be poorly conditioned and become unstable. 
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3.1 How the MED is Obtained from Call and Digital Prices 



We shall recall briefly how each G is obtained from call and digital prices. Firstly, we introduce 
functions cq, ...,c„ which will appear repeatedly in the following: 



In 



c^{^i) ■■= { 



ln(i^,+i - K, 
In ■ 



Their first and second derivatives are given by 



m = 



K,,- 



1 



and 



/3 



for i < n and /3 7^ 0, 
for i < n and P — 0, 
for i — n and /? < 0. 

for i < n and /? 7^ 0, 
for i < n and /? = 0, 
for i = n and /3 < 0. 



(7) 



c'/(/3) = 



12 
1 



for i < n and /3 7^ 0, 

for i < n and /3 = 0, 
for i = n and ^9 < 0. 



Next, we recall the definition and some properties of the Legendre transform of a convex function. 
Let / be a convex function on R. Define the Legendre transform of / as 



f*{y) -.^snpixy - f{x)}, ye 



(8) 



We will frequently use the following properties of differentiable convex functions, which we do not prove 
here. (See [5], [T3], [H] for more details about the Legendre transform.) 

Proposition 3.1 Let f be a differ entiable and strictly convex function on R. Then the following con- 
clusions hold. 

(i). xy < f{x) + f*{y) with equality holding if, and only if, y = /'(x). 
(Hi), f* is convex and [/*]* — f ■ 
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(b) Graph of c\. 
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(c) Graph of [c*]'. (d) Graph of c*. 

Figure 1: Graphs of 4, [c*]' and c* when Ki = 10 and Ki+i = 30. 



The functions ci, ...,c„ are differentiable and strictly convex, and we describe their Legendre trans- 
forms. Figure [1] shows the graphs of q, c-, [c*]' and c* when Ki = 10 and Ki+i = 30. In we 
have shown that is continuously differentiable and strictly increasing, and that c^(/3) goes to Ki and 
Ki+i when j3 goes to +00 and —00, respectively. For i < n, we have c^(0) — {Ki + Ki^i)/2, and from 
Proposition 13.1 l[(ii)}[(iii) | it follows that {Ki + Ki+i)/2 is a root of [c*]' and a minimum of c*. 



From Proposition 13.11 (i) ■ (ii)[ we have the equivalences 



[c:]'{K) = /3 



K = c'm 



c,{P)+c*{K)=pK. 



(9) 



Finally, recall from [21] the following definitions and results. Under no-arbitrage conditions ([SJ, that 
is, for all D € there is a unique density gf, which maximises entropy and matches call ([l} and digital 
(m prices. Moreover, for alH = 0, n we have: 



• The probability of S'(T) being inside a bucket [Ki, Ki-f-i[ is 

rK,+i 

K := FSo[K, < S{T) < K,+i] = / gD{x)dx = A - A+i- 



(10) 
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The conditional expectation of S{T) given that it is inside the bucket is 

ia + K,D,) - + i^.+i A+i) 

Pt Jk, 



Wi>[S{T)\K, < SiT) < K,+{\ = - [ xgj^ix)dx 

Pi Jk, 



(11) 

Furthermore, on [Ki, Ki^i[ the density 5^ has the form gf){x) = aiC^^'^ where 

a, = P^e-'^^^'\ (12) 
cUA) = k,. (13) 

For notational convenience again, in dill) , we make the convention that Kn+iDn+i '■— 0. 

Remark that in order for (fTP)) and pT|) to hold, it is necessary that Pi > i) and Ki < Ki < Ki^i for 
all i = 0, ...,n. These conditions are equivalent to ([5|). 

When computing a; and equation (jl3p is the only one that cannot be solved explicitly, since we 
do not know the expression of [c[]~^ — [c*]'. However, in practice, it can be solved very quickly with a 
one-dimensional Newton-Raphson rootfinder, since we have the derivative c" analytically. Typically, it 
takes about three steps to obtain a very precise result, since the two functions are very well behaved. 

4 Maximum Entropy in Terms of Market Data 

Using the Legendre transform, we now show that for any Z? G the entropy of e can be expressed 
directly in terms of market data with no need to find the piecewise-exponential form of . 

Theorem 4.1 For all D €z Q the entropy of gf^ can be expressed as 

n n 

H{9d) = - ^p.,\np^ - ^piC*{Ki), 

1=0 i=0 

where pi and Ki are given purely in terms of option prices by (|10p and (jlip . for all i = 0, ...,n. 

Proof Let D E il. For each i = Q, ...,n, let pi, Ki, ai and f^i be given by equations (jlOp and ([TT]) . (fT^ 
and (fT3|l . respectively. 



Fix i G {1, ...,ri} and let Hi(gjj) denote the entropy of over the i-th bucket, that is. 



Hiigr))-^- I gQ{x)\xYgfj{x)dx = -lnui / g^{x)dx-l3i i xg^{x)dx. 

JKi JKi JKi 

From (Uni) and it follows that 

Hiigi)) = -Pi(lnai + PiKi). 

Using p^ . we get 

Hi{gf,) = -P,(lnp., - c,(A) + 

Finally, ^ and ® yield 

Hiigi)) = -Pi'^^Pi -PiC*{ki). 
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Adding up over i — 0,...,n yields 



1=0 1=0 1=0 □ 



The expression of H{gjj) given in Theorem 14 . 1 1 can be split into discrete and continuous parts: 

n 

H\gD).= J2Hf{g^), where Hf{g^) := -p^np,, Vi = 0,...,n, (14) 

4=0 

n 

H'{9d) ■■^Y.^Kgb), where H^ig^) := ~p.,c*{K.,), Vj = 0,...,n. (15) 

1=0 

The term H'^igQ) can be seen as a Shannon type discrete entropy, whose maximum is attained when all 
Pi are equal, i.e. when all buckets have equal probability, whereas each H'[{gfy), for i < n, attains its 
maximum when Ki lies exactly in the middle of Ki and iiTi+i, i.e. Ki ~ (Ki + Ki^i)/2. 

4.1 The Sensitivity of Maximum Entropy with Respect to Digital Prices 

With fixed strikes and call prices, under the light of Theorem 14. 11 _ff , H'^ and H"^ can be seen as functions 
of digital prices only. Now we show how H is affected by digital price changes. 

Theorem 4.2 As a Junction of digital prices, H : fl ^ W is differentiable and, for all D ^ fl, we have 

r) M 

-^{b) - ln.g£,(J^,-) - ln5^(if,+), Vz - 1, n, 

where 

gf){Ki-):= lim g^{x) ^ Ui^ie'^'-^'^' and g^{K^+) -.^ lim .g^(a;) = aiC^'^S 
with at and Pi given by ([T^ and (jl3p . respectively, for all i — 0, n. 

Proof Let D G H.. For each i = 0, ...,n, let pi, Ki, cti and f^i be given by equations (|10p and ([TT|) . 
and (jl3p . respectively. 

Fix i e {1, ...,n}. Throughout the proof, we use the following consequences of (fTO|) and (jlip : 

dpi dpi-i dKi^i K.,^i - Ki^i dki Ki - Ki 

1, — = — = — 1, — = and 



db, ' dbi ' ^p^-l pt-i dpi pi 



Starting with the discrete part, from (ITi)) we have 



dpi-i / , \ dpi 



db-^'^^^dbS- ^"^'^^^ " '"^^1 = ( - ^"^^-^ -')^mi- ^"^^ ' 



dD, V / dD., 



lnp,j_i - Inpi. (16) 
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For the continuous part, from (jl5p we have 
'i 

dp, 



dDi dD 



dD 



4{k,)^pM]'{k,) 



K, - K, 



Pi-i J \ Pi 

Usmg ® and 1^ yields [c*^^]'{K^.i) = and [c*]'{K,) = Hence, 

f)TJC _ _ _ _ 

-{b) = c*_i(i^,_i) - - if,) - c*{k,) - A(if, - X,) 



Again, using © we get 



dH 



dD 

Putting (jl6D and (TTfl) together and using (fT^ we obtain 

n rr Tjd c\ rjc 

dDi dDi dDi 

= lna,_i + ft_iif , - In a, - /3,if, = ln(a,_ie^-i^') - ln(a,e'^'^-) 
= ln5^(if,-)-ln5^(if,+). 



(17) 



□ 



Corollary 4.3 For any D Cz fl, the density gf) (z G is continuous if, and only if, D is a critical point of 
H , that is, 

d M 

-~-ib)=0 V*-l,...,n. 

dDi 

Proof This is a direct consequence of Theorem 14.21 and the fact that in the interior of each bucket gjj 
is exponential. □ 



5 The Buchen-Kelly Density is the only Continuous MED 

We know that the Buchen-Kelly density is continuous and has the greatest entropy of all densities in 
G. In agreement with Corollary 14.31 digital prices implied by the Buchen-Kelly density correspond to 
a point of maximum entropy and, thus, a critical point of H in f2. Actually, the digital prices implied 
by the Buchen-Kelly density form the only critical point of H in il. Indeed, H is strictly concave as we 
shall see in Proposition 15.21 

Now we state and prove a lemma which we need for our algorithm and for the proof of Proposition 
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Lemma 5.1 The function : $7 — > M zs twice dijjerentiable and its Hessian matrix at any D G Q is 
symmetric and tridiagonal with entries given by 



dD 



1 1 {K,-K^^if {K^-K{) 



Vi = 1, n, 



dDm,+i P. p^c'lm 



Vi = l,...,n — 1, 



where pi, Ki and j3i are given by (llOp . and p3p . respectively, for all i — 0, ...,n. 

Proof Let I? e il. For each i = 0, let pi, Ki and Pi be given by equations ([T0| . (fTTj) and ([T3|. 
respectively. 

The fact that H"{D) is tridiagonal and symmetric is clear from the expression of iJ's partial deriva- 
tives given in Theorem IT^ Actually, the discrete part [H'^]" {D) and continuous part [H'']"{b) of H"{D) 
are themselves symmetric and tridiagonal. 

Fix i e {1, ...,n}. We begin by calculating the second derivatives of H"^ starting from ([TS]). For the 
diagonal entry we have 



^^^^ "'^"^^ 
and, if i < rt, for the off-diagonal entry we have 



1 dpi-i 1 dpi 



1 1 



Pi-i dD, PtdD, 



Pi-i Pi 



\npi 



1 dp, ^ l_ 
Pi dD,+i Pi 



Next, for the continuous part, starting from (jl7l) we get 



dDf dD, 



dDi dDi 



and, if z < n, 
d^H" 



-{D) = 



d 



dp. 



c,{fi,)-P,K, = [c',{l3,) - K,) 



dD,dD,+i dD,+i 
In the last steps, we used 

{K,-K,^,) dp, {K,-K,) 



P^c'l{P^) 
dp, - K, 



dD, 



P^-lC'U{P^-ly dD, P,C'I{P,) 



and 



P^4{P^ 



(18) 



(19) 



(20) 



(21) 



which are obtained by differentiating equation c[_i{Pi^i) = Ki^i w.r.t. Di and equation c'i{Pi) — Ki 
w.r.t. Di and Di+i- □ 
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Proposition 5.2 The function H : il ^ ^ is strictly concave. 



Proof Let D E il. It is enough to show that H"{D) is strictly negative definite. To accomplish this, 
we shall prove that [H'^]"{iD) is strictly negative definite and [H"]" [D) is negative definite. 

For each i = 0, n, let p^, Ki and Pi be given by equations pUj) . pT|) and ([T^. respectively. 

Recall that for an n x n triangular symmetric matrix A — (flij ) we have 

n n— 1 

(Mrf) ^^o^.^rf^ + Z^aM+i^^^t^m Vd=(di,...,d„)€M". (22) 

i=l i=l 

Apply this relation to A ^ H'^{D) and d e M", with d^Q, and use ([TH]) and ([H]) to get 

,2 n — 1 72 I j2 72 n— 1 , , 



Po 
< 0. 

Now, apply (EH to A = i/=(Z)) and d e M" and use ^ and jH]) to obtain 

[H] {D)d,d)^~i^ v.-.^L^.-^ ^ P.<^m '^^'^ 



p^d'm 



n-1 

i=l 

n-1 



{K, ~ K,)d,{K,+i - K,)d,+i 

p^c"m 



^ jKi^Roldl ^ [{K, - K,)d, + - _ (X„ ~ 

Poc^)'(/3o) P^'^'^^f^^^ Pn<(/3«) 

< 0. 

(Actually, with a small extra effort one gets strict inequality above for d 7^ 0.) □ 

Remark that the last proof implies that H"{D) is invertible for all D E Vl. This is important to 
assure that the Newton-Raphson step for maximizing H is well defined. We will come back to this point 
in Section [6] 



From Proposition 15.21 we immediately obtain the next result. 
Corollary 5.3 The following statements are equivalent: 
(i). gf) £ G is the Buchen-Kelly density. 
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(ii). D € ^ is a critical point of H . 
(Hi), gjj ^ Q is continuous. 



Proof We already knew that (i) (ii) <^ (iii) and the last proposition fills the gap (ii) =4> (i) □ 



5.1 Some Remarks Regarding the Buchen-Kelly Density 

We conclude this section by giving some simple but useful formulas for expressing the Buchen-Kelly 
density in terms of our a^'s and ft's, and vice versa. The analytical formulas from Section 2.3 in ^2T| 
can then be applied to densities in Buchen-Kelly form. 

Buchen and Kelly label the calls Ci observed at strikes Ki in the market from i ~ l,...,m. The 
density in [10] is explicitly given there in equation (8), for Ci{x) = [x — Ki)^ , by 



if™ \ r°° ( ™ 

:i{x) = -ex^piy^Xiix - K.j)+ \ , A* = / exp 

^ \^=l J \^=l 



{x~K,)+\dx. (24) 



For X € [Kj, KjJ^.l[, we can write this as 



g{x) = - exp [ - ^ \Ki j exp [ ^ A^ 



Assume Ki — 0. Then we obtain the following conversion formulas: 

ao = -, ai = -e-^^^% a,„_i = ^e" 5:r=2 ^.i^. ^ 

rn 

/3o = Ai, I3i = \i + X2, Pm-i = Aj. 

i=l 

Obviously only a continuous density can be written in the Buchen-Kelly form. For such a density, 
we have 

Al = I3q, A2 = /?! — /3o, Am = /?m-l " Pm-2, 

1 

M = • 

ao 



6 An Algorithm to Find the Buchen-Kelly Density 

CoroUarv 15.31 savs that if _D e il is a root of H' , then 5^ is the Buchen-Kelly density. Since the Hessian 
matrix of H is known analytically, the Newton- Raphson method is a possible choice of algorithm to find 
the root of H' numerically, and consequently the Buchen-Kelly density. 

The choice of the Newton-Raphson method for root-finding problems is not without its concerns. 
Nevertheless, in this section we shall see that for our particular problem these concerns vanish if we 
combine a pure Newton-Raphson method with an initial phase of backtracking line search. This is the 
damped Newton method presented in [5], page 487. 
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First of all, there is the issue of finding the wrong root when there is more than one. Of course, this 
cannot happen here since uniqueness holds. 

Then there is the high cost of computing and storing the, normally, entries of the Hessian matrix. 
In our case, this matrix is symmetric and tridiagonal (Lemma 15. ip and, thus, the cost applies only to 
2n — 1 entries. 

Each step of the Newton- Raphson method for finding the root of H' entails solving, for d £ M", 
the linear system H"{D)d = H'{D), where D G is the current guess for the solution. Therefore, the 
method fails if H"{D) is not invertible. However, this cannot happen since we have {H"{D)d,d) < if 
d^O (Proposition [63I). 

Again, solving the n x n linear system above might be very costly in general. Here, since H'{D) is 
tridiagonal, the system can be efficiently solved in C(n) operations using, for example the routine tridag 
from [23]. Regarding the stability of this routine, we quote [SJ, page 57: 

The tridiagonal algorithm is the rare case of an algorithm that, in practice, is more robust 
than theory says it should be. 

Instability of the linear system is then unlikely but if it does occur, we still have room for improvement. 
There is a well known link between instability and H" {D) having a large condition number. The latter 
is connected to the eccentricity of level curves of H . Hence the geometry of the n-dimensional rectangle 
f2 plays an important role. By making f2 less eccentric through affine transformations, one can decrease 
the condition number of H"{D), thus improving stability of the linear system. This does not affect the 
Newton- Raphson algorithm since it is invariant by afHne transformations [S]. 

It is worth mentioning that linear tridiagonal systems arise in a broad range of problems. For this 
reason they have been extensively studied and several algorithms (including parallel ones) have been 
proposed [ISl, [IS], HO]- 

The last concern is that a pure Newton-Raphson method does not guarantee that each guess stays 
in ri, the domain of H. In combination with the backtracking line search this issue becomes just an 
implementation detail as explained in p], page 465. 

The remainder of this section presents results that allow us to follow the convergence analysis of the 
algorithm presented in 

The next result strengthens Proposition 15.21 
Proposition 6.1 The function H : Q ^ R is strongly concave, that is, there exists an to > such that 

^H"{D)d,d^ < ~m\\df, yDen and Vd G M". 
Furthermore, the constant m can be taken as 4sin^(7r/(2n + 2)). 

Proof In the proof of Proposition 15.21 we showed that both H'^ and H'^ satisfy a relationship similar to 
the one we aim to prove but with to = 0. Strengthening this relationship for H'^ to m > is enough to 
finish the proof. 
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Let D E and d E M". From fP5|) . and using that < 1 for all z = 0, n, we get 

n— 1 n—1 n—1 

'[HXimd) < -d\ - ^(rf, - - di = -di - Y^idj + dj^,) + 2^ - di 

2—1 i—1 i—1 

n n — 1 

= -2 ^ d2 ^ 2 ^ = - {Ad, d) , 



where ^ is the nx n tridiagonal symmetric matrix with 2's in the diagonal and — I's in the off-diagonals. 
This is a well known matrix that arises in the finite difl[erence discretisation of the heat equation. Its 
eigenvalues are 4 sin^(fc7r/(2n + 2)) for k = l,...,n [53]. Therefore the result holds if we take m — 
4sin^(7r/(2n + 2)), the smallest eigenvalue of A. □ 

Since H is strongly concave, we can apply the arguments of [S], Section 9.1.2, which we repeat in the 
next proposition for the sake of completeness. This proposition gives estimates on how far an element 
D G Q is from the Buchen-Kelly digitals and how much H(D) is below the maximum. Both estimates are 
given in terms of Since H'{D) is easily computed (Proposition 14. 2p at each Newton- Raphson 

step, these estimates can be used to define a stopping criterion. 

Proposition 6.2 Let D Cz Q be such that gjy (z G is the Buchen-Kelly density and let m > be as in 
Provosition \6. 11 Then, for all D ^ Q we have 

H(D)-H(D)<-^\\H'(D)f and \\D - D\\ < —\\H' (D)\\. 

2m m 



Proof Let D ft and set d — D — D. The second-order Taylor expansion of H around D gives 9 ^ (0,1) 
such that 

H{D)~H{b) ^ (H'{b),d^ + ^ (H"{b + ed)d,d^ < (^H'ib),d'j ~ ylldlp. 

Seen as a function of d G K", the rightmost term above is a concave quadratic function which reaches 
its maximum at m^^H' {D). Therefore, substituting d by m~^H' {D) yields our first result. 

By the Cauchy-Schwarz inequality we have (^H' {D), d^ < . Therefore, 

H{b) H{b) < {H'iD),d) < ||d|| {\\H'iD)\\ y ||d||) . 

Since H{D) — H{D) > 0, so is the righthand side of the expression above, which implies the second 
result. □ 

We also have an estimate for the distance (in the sense of the L^{Q, oo) norm) between gj^ £ Q and 
the Buchen-Kelly density. First we need to measure this distance in terms of the relative entropy. 

For two strictly positive probability densities / and g over [0, oo), the relative entropy or I-divergence 
of / with respect to g is given by 

/(/II.9)- r f{x)\nl^dx. 

Jo g(x) 

The relative entropy of / with respect to g is, in some ways, similar to a measure of the "distance" 
between these two densities [TI]. For instance, I{f\\g) > with equality holding if, and only if, f — g. 
However, the relative entropy is not a metric. 
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Theorem 6.3 Let g^^^Qhe the Buchen- Kelly density. For all D G we have 



\\g^^g^\\^^< \\H'{D 



Proof Recall that the Buchen-Kelly density has the form gf){x) — fJ. ^ exp (X^iLo " ^i)^) for all 
X > and some constants fi > 0, Aq, A„. For any D £ CI we have 

/ g fyln g jjdx — ~ j 5^ In /idx + Ai / [x — Kj)^ g^dx = — In/i + XjCj. 

In particular, the integral above does not depend on D. Moreover, taking D — D gives that the integral 
is equal to —H{D). Therefore, 

/•OO _ POO POO 

H9d\\9d)^ 9b^^—dx^ I gf^lng^dx- gfjlng^dx^H{D)-H{D). 
Jo 9d Jo Jo 



From the Csiszar-Kullback inequality [T] and Proposition 16.21 we get 



1.96 - 9D\\h < 2I{9d\\9d) - 2 [H{D) - H{D)) < -\\H'{D)\\\ 



and the result follows. 



□ 



6.1 Two Low-Dimensional Examples 

To see that usually we are dealing with a very smooth problem, consider the case where the forward 
F = Cf) and a call price Ci at a strike Ki are given. The digital Di (at the same strike Ki) can vary 
between the bounds imposed by ([S]). 





(a) Graph of H{Di). 



(b) Graph oi H{Di,D2). 



Figure 2: Graphs of H{Di) and H{Di,D2). 



In Figure 2(a) we have plotted the graph of H for 

F = 100.00, 
Ki = 100.00, Ci = 9.9477. 
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By © we must have Di e n =]0, 0.9005[. 

H attains its maximum of 4.6801 at Di = 0.4962. If we think of prices being given by the Black- 
Scholes formula with r = 0, T = 1 and a = 0.25, then the Black-Scholes price of the digital is = 
0.4503. Note that over most of the interval fi, H' is basically a linear function, so that starting with a 
decent guess for Di leads almost immediately to the solution. For example, if we start with the middle 
digital Z)^" — 0.4503 (which, curiously, is almost, but not quite, identical to Df^), the algorithm takes 
just 2 steps to find the root to a tolerance of 10~®. 

In Figure [2(b)] we have plotted H{Di,D2) for 

F = 100.00, 
Ki = 80.00, Ci = 22.2656, 
K2 = 120.00, C2 = 3.7059. 

In this set up we have fl =]0.4640, 0.9717[x]0, 0.4640[. Again, starting with middle digital prices D\'^ = 
0.7178, = 0.2320 leads to the maximum i7(0.7884, 0.1991) = 4.6208 in 3 steps. 

6.2 Some Differences to Buchen and Kelly's Algorithm 

It's easy to suggest a starting point for the root search, for example the middle between the left and 
right hand sides of equation ([S]), or alternatively a single call spread as in equation I^E^, which amounts 
to the same thing if the spacing between the strikes is constant. 

Note that compared to the Buchen-Kelly algorithm: 

• We start with a density that already matches all given call prices, and the root-search takes place 
in the family Q of densities matching the call prices. Therefore, at each step in the algorithm, the 
occurring density has a precise financial interpretation. 

• The Hessian matrix H" is tridiagonal, so that inverting it is extremely easy and fast. In [10], the 
corresponding matrix is given as the covariance matrix between the calls at the different strikes, 
and all elements will usually be non-zero. 

• The dimension of the root-finding problem considered here is one less than in [TU] . For example, 
given the forward F = Cq and a call Ci as in the first scenario above, this is a one-dimensional 
problem, not two-dimensional. 

7 Comparing Densities for an Increasing Number of Strikes 

When call prices are available at a large number of strikes, then the upper and lower boundaries given 
by left and right call spreads, respectively, define rather tight intervals for arbitrage-free digital prices. 
Of course, in the limiting case where call prices are available at all strikes, they completely determine 
the density, as is well known from [5], and the digital prices are uniquely determined. 

When call prices are available at only a small number of strikes, the digital prices can be from rather 
large intervals, and densities matching the call prices can differ quite substantially. However, as the 
number of strikes increases, these differences become smaller and smaller. We measure these "distances" 
between the densities using relative entropy. Note that by the Csiszar-KuUback inequality [T] convergence 
in terms of relative entropy implies L^-convergence. 
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For two piecewise exponential densities, the relative entropy can be easily calculated. If, for x G 

[Ki^Ki^i[^ we have g{x) — aiC^^^ and the prior p{x) = a^e'^*^, then the relative entropy is given by 

: , . (25) 



Of course, from the proof of Theorem l6.3[ we know that in case p is the Buchen-Kelly density and g 
a density matching the same call prices, then = -ff (p) — -ff (.9) holds. 

7.1 A Black-Scholes Example without Volatility Skew 

In this first example, assume a flat Black-Scholes market given by the data 

F = 100.00, r = 0, a = 0.25 and T = 1, 
where all call option prices are calculated with the Black-Scholes formula. 

First, we calculate the Buchen-Kelly density in four cases, when 3, 5, 9 and 17 strikes are given, 
respectively, with spacing decreasing from 40, 20, 10 to 5. 

Second, we calculate the entropy maximising density from |21| using digital prices obtained via 
centered call spreads (CCS) 

^r:-- g^^'5"\ ^ = 2,...,r^-l. (26) 

At left and right endpoints {i — l,n), we don't apply this formula and use the Buchen-Kelly digital 
prices instead. 

Third, we calculate the entropy maximising density from |21) using Black-Scholes digital prices ob- 
tained from the formula 

j^BS ^ \Ak}_^ j, z=.l,...,n, (27) 

where A'^ is the standard normal cumulative distribution function. 

Digital prices and entropy of the different densities are reported in Table [T] 

We also calculate the relative entropies with respect to the first density. It can be seen that, as 
the number of strikes increases, the approximations given by either (|26l) or (l?fl) become closer to the 
Buchen-Kelly digital prices and the relative entropy, obtained using (^5]) with the Buchen-Kelly density 
as the prior density, decreases. 

Note also that in this flat market scenario, the digital prices given by the Black-Scholes formula are 
practically the same as the Buchen-Kelly prices if five strikes or more are calibrated to. 

7.2 An Example with Call Option Prices on the S&P 500 

In the second example, the market is given by CBOE option prices from 10 April 2010 with maturity 18 
December 2010 and a forward F = 1178.00. 
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Table 1: A Flat Black-Scholes World 









K 


60 


65 


70 


75 


80 


85 


90 


95 


100 


105 


110 


115 


120 


125 


130 


135 


140 


n 


Entropy 


Rel. Ent. 


C 


40.145 


35.346 


30.719 


26.336 


22.266 


18.565 


15.272 


12.401 


9.948 


7.889 


6.190 


4.811 


3.706 


2.832 


2.149 


1.620 


1.214 


3 


4.616 


0.000 


BK D 


0.967 
















0.465 
















0.070 


5 


4.608 


0.000 




0.973 








0.779 








0.451 








0.197 








0.070 


9 


4.607 


0.000 




0.974 




0.903 




0.779 




0.617 




0.450 




0.306 




0.196 




0.120 




0.070 


17 


4.607 


0.000 




0.973 


0.945 


0.904 


0.847 


0.779 


0.700 


0.617 


0.532 


0.450 


0.374 


0.306 


0.247 


0.196 


0.154 


0.120 


0.093 


0.070 


3 


4.613 


0.003 


CCS D 


0.967 
















0.487 
















0.070 


4 


4.587 


0.021 




0.973 








0.755 








0.464 








0.218 








0.070 


9 


4.596 


0.011 




0.974 




0.894 




0.772 




0.616 




0.454 




0.312 




0.202 




0.125 




0.070 


17 


4.604 


0.004 




0.973 


0.943 


0.901 


0.845 


0.777 


0.699 


0.616 


0.532 


0.451 


0.376 


0.308 


0.248 


0.198 


0.156 


0.121 


0.093 


0.070 


3 


4.614 


0.002 


BS D 


0.972 
















0.450 
















0.071 


5 


4.608 


0.000 




0.972 








0.779 








0.450 








0.196 








0.071 


9 


4.607 


0.000 




0.972 




0.903 




0.779 




0.617 




0.450 




0.306 




0.196 




0.120 




0.071 


17 


4.607 


0.000 




0.972 


0.945 


0.903 


0.847 


0.779 


0.700 


0.617 


0.532 


0.450 


0.374 


0.306 


0.247 


0.196 


0.154 


0.120 


0.093 


0.071 



First, we calculate the Buchen-Kelly density in four cases, when 3, 5, 9 and 17 strikes are given, 
respectively, with spacing decreasing from USD 200, 100, 50 to 25. 

Second, we calculate the entropy maximising density from \TT\ using digital prices obtained from the 
approximation (1261) . Again, at left and right endpoints we use the Buchen-Kelly prices. 

Digital prices and entropy of the two densities are reported in Table [5J 

We also calculate the relative entropy of the second density with respect to the first density. 

Again, it can be seen that, as the number of strikes increases, the approximations given by (j26p 
become closer to the Buchen-Kelly digital prices, and the relative entropy obtained using (j25p with the 
Buchen-Kelly density as the prior density decreases. 

Note that in both examples given in 17.11 and 17. 2[ we can observe the pattern that in-the-money 
{Ki < F) CCS digital price estimates tend to be too low and out-of-the-money {Ki > F) ones too 
high compared to the digital prices obtained with the Buchen-Kelly density. One could therefore try to 
establish better initial guesses, but we will not pursue this here. 

8 Conclusion 

In this article we study the family Q of piecewise exponential densities matching given call option prices 
with different strikes and for a fixed maturity T. Considerations of entropy maximisation lead directly 
to this particular form, which has the advantages of being easy to work with (distribution functions 
can be calculated and inverted analytically, for example) and generating very realistic looking "volatility 
smiles" . 

We make clear that despite the constraints imposed by the call option prices, the probability assigned 
by different densities in Q for the underlying asset to lie in a given interval can vary considerably. This 
will obviously have a significant impact when pricing just about any type of derivative, beginning with 
digital options. However, the more such call option prices are observable, the smaller the scope for these 
variations will become. So, on the one hand, when only the forward price of the asset and an at-the- 
money call option price are known, the probability of, for example, the asset finishing in-the-money will 
change substantially, depending on the density chosen from Q. On the other hand, in the theoretical 
case where call option prices are available at all strikes, these will be enough to completely determine 
the density (the family will consist of just this one member), and therefore also the probability of the 
asset finishing in-the-money. 

Giving digital option prices at the same strikes as for the calls uniquely determines a density in the 
family Q, and we therefore think of it as being parameterised by these digital prices. An analysis of the 
entropy function H over this family allows us to show that there is a unique density maximising H, the 
Buchen-Kelly density, which also is the only continuous density in Q. 

Based on these results we introduce a Newton- Raphson algorithm for finding the Buchen-Kelly density 
that, in our opinion, is simpler than the one originally proposed, and we list some of its advantages. Using 
the gradient of expressed as a function of digital prices, we give three convergence criteria for the 
algorithm. 

Finally, in two market scenarios, one fictitious and the other with market data from the CBOE, we 
illustrate how the densities matching these call prices converge to the Buchen-Kelly density when prices 
at more and more strikes are given. 



19 



Table 2: An Example with Call Option Prices on the S&P 500 



to 
o 







K 




1000 


1025 


1050 


1075 


1100 


1125 


1150 


1175 


1200 


1225 


1250 


1275 


1300 


1325 


1350 


1375 


1400 


n 


Entropy 


Rel. Ent. 


C 


207.919 


186.796 


166.526 


146.958 


128.294 


110.583 


93.925 


78.522 


64.473 


51.879 


40.791 


31.308 


23.431 


17.059 


12.042 


8.228 


5.168 


3 


6,6363 


0,0000 


BK D 


0,843 
















0,530 
















0.095 




6,6345 


0,0000 




0,846 








0,732 








0,532 








0,289 








0.091 


7 


6.6325 


0.0000 




0,851 




0.800 




0.729 




0.642 




0,534 




0,411 




0.283 




0,180 




0.095 


17 


6.6234 


0.0000 




0.857 


0.829 


0.797 


0.766 


0.728 


0.689 


0.642 


0.590 


0.533 


0.474 


0.412 


0.347 


0.284 


0.227 


0.173 


0.137 


0.104 


3 


6.6314 


0.0049 


CSS D 


0.843 
















0.507 
















0.095 


5 


6.6266 


0.0079 




0.846 








0.717 








0.524 








0.297 








0.091 


7 


6.6288 


0.0037 




0.851 




0.796 




0.726 




0.638 




0.531 




0.410 




0.287 




0.183 




0.095 


17 


6.6217 


0.0017 




0.857 


0.828 


0.797 


0.765 


0.728 


0.687 


0.641 


0.589 


0.533 


0.474 


0.411 


0.347 


0.285 


0.228 


0.177 


0.137 


0.104 
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